Figure 1: 161P2F10B SSH sequence of 182 nucleotides (SEQ. ID. No. 742) 



1 GATCACACAT TAGGTTATNG ACTTCAATAT 

61 TAAAACTGGG TCCATGTGCC AAGAAAGATA 

121 CCATGGTTGC CTCCTCCACA ATTTGTATTT 

181 TC 



TTTCAAATGG TTCAACTTCA GTCTTCTCTT 
GCCTCCATGC TCCTAAACTC ATTGTTATAA 
GATTTACTCC TAACAGCCAG CCACTGTTGA 
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Figure 2. 

Figure 2A. The cDNA (SEQ ID. NO. : 744) and amino acid sequence (SEQ ID. NO. : 743) of 
161P2F10B. The 3858 nucleotide sequence of 161P2F10B is shown. The codon for the start methionine is 
underlined. The open reading frame extends from nucleic acid 44-2671 including the stop codon. 



1 M E S T L T 

1 ctactttattctgataaaacaggtctatgcagctaccaggacaATGGAATCTACGTTGAC 
7LATEQPVKKNTLKKYKIACI 

6 1 TTTAGCAACGGAACAACCTGTTAAGAAGAACACTCTTAAGAAATATAAAATAGCTTGCAT 

27VLLALLVIMSLGLGLGLGLR 
121 TGTTCTTCTTGCTTTGCTGGTGATCATGTCACTTGGATTAGGCCTGGGGCTTGGACTCAG 

47 KLEKQGSCRKKCFDASFRGL 
181 GAAACTGGAAAAGCAAGGCAGCTGCAGGAAGAAGTGCTTTGATGCATCATTTAGAGGACT 

67ENCRCDVACKDRGDCCWDFE 
241 GGAGAACTGCCGGTGTGATGTGGCATGTAAAGACCGAGGTGATTGCTGCTGGGATTTTGA 

87DTCVESTRIWMCNKFRCGET 
301 AGACACCTGTGTGGAATCAACTCGAATATGGATGTGCAATAAATTTCGTTGTGGAGAGAC 
107 RLEASLCSCSDDCLQKKDCC 
361 CAGATTAGAGGCCAGCCTTTGCTCTTGTTCAGATGACTGTTTGCAGAAGAAAGATTGCTG 
127 ADYKSVCQGETSWLEENCDT 
421 TGCTGACTATAAGAGTGTTTGCCAAGGAGAAACCTCATGGCTGGAAGAAAACTGTGACAC 
147 AQQSQCPEGFDLPPVILFSM 
481 AGCCCAGCAGTCTCAGTGCCCAGAAGGGTTTGACCTGCCACCAGTTATCTTGTTTTCTAT 
167 DGFRAEYLYTWDTLMPNINK 
541 GGATGGATTTAGAGCTGAATATTTATACACATGGGATACTTTAATGCCAAATATCAATAA 
187 LKTC G I H S KYMRAMYPT KTF 
601 ^TG AAAACATGTGGA ATTCATTCAAAATAC^TGAGAGCTATGTATGGTACCAAAACCTT- 
207 PNHYTIVTGLYPESHGI IDN 
661 CCCAAATCATTACACCATTGTCACGGGCTTGTATCCAGAGTCACATGGCATCATTGACAA 
227 NMYDVNLNKNFSLSSKEQNN 
721 TAATATGTATGATGTAAATCTCAACAAGAATTTTTCACTTTCTTCAAAGGAACAAAATAA 
247 PAWWHGQPMWLTAMYQGLKA 
781 TCCAGCCTGGTGGCATGGGCAACCAATGTGGCTGACAGCAATGTATCAAGGTTTAAAAGC 
267 AT YFWPGSEVAINGS FPS IY 
841 CGCTACCTACTTTTGGCCCGGATCAGAAGTGGCTATAAATGGCTCCTTTCCTTCCATATA 
287 MPYNGSVPFEERISTLLKWL 
901 CATGCCTTACAACGGAAGTGTCCCATTTGAAGAGAGGATTTCTACACTGTTAAAATGGCT 
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307 DLPKAERPRFYTMYFEEPDS 

961 GGACCTGCCCAAAGCTGAAAGACCCAGGTTTTATACCATGTATTTTGAAGAACCTGATTC 

327 S GHAGG P V S ARV I KAL QV VD 

1021 CTCTGGACATGCAGGTGGACCAGTCAGTGCCAGAGTAATTAAAGCCTTACAGGTAGTAGA 

347 HAFGML .MEGLKQRNLHNCVN 

1081 TCATGCTTTTGGGATGTTGATGGAAGGCCTGAAGCAGCGGAATTTGCACAACTGTGTCAA 

367 I ILLADHGMDQTYCNKMEYM 

114 1 TATCATCCTTCTGGCTGACCATGGAATGGACCAGACTTATTGTAACAAGATGGAATACAT 

387 TDYFPRINFFYMYEGPAPRI 

1201 GACTGATTATTTTCCCAGAATAAACTTCTTCTACATGTACGAAGGGCCTGCCCCCCGCAT 

407 RAHNIPHDFFSFNSEE IVRN 

1261 C CG AGC TC AT AAT ATAC CTCATGACTTTTTT AGTTTT AATTCTGAGG AAATTGTTAGAAA 

427 LSCRKPDQHFKPYLTPDLPK 

1321 CCTCAGTTGCCGAAAACCTGATCAGCATTTCAAGCCCTATTTGACTCCTGATTTGCCAAA 

447 RLHYAKNVRIDKVHLFVDQQ 

1381 GCG ACTG CACTATG C C AAGAACGTCAG AATCGAC AAAGTTCATCTCTTTGTGGATCAACA 

467 WLAVRSKSNTNCGGGNHGYN 

1441 GTGGCTGGCTGTTAGGAGTAAATCAAATACAAATTGTGGAGGAGGCAACCATGGTTATAA 

487 NEFRSMEAIFLAHGPS FKEK 

1501 CAATGAGTTTAGGAGCATGGAGGCTATCTTTCTGGCACATGGACCCAGTTTTAAAGAGAA 

507 TEV EPFENIEVYNLMCDLLR 

1561 GACTGAAGTTGAACCATTTGAAAATATTGAAGTCTATAACCTAATGTGTGATCTTCTACG 

527 IQPAPNNGTHGSLNHLLKVP 

1621 CATTCAACCAGCACCAAACAATGGAACCCATGGTAGTTTAAACCATCTTCTGAAGGTGCC 

547 FYEPSHAEEVSKFSVCGFAN 

1681 TTTTTATGAGCCATCCCATGCAGAGGAGGTGTCAAAGTTTTCTGTTTGTGGCTTTGCTAA 

567 PLPTESLDCFCPHLQNSTQL 

174 1 TCCATTGCCCACAGAGTCTCTTGACTGTTTCTGCCCTCACCTACAAAATAGTACTCAGCT 

— 587 E — Q — V — N — Q — M — L N L TQEEITATVKV 

1801 GGAACAAGTGAATCAGATGCTAAATCTCACCCAAGAAGAAATAACAGCAACAGTGAAAGT 

607 NLPFGRPRVLQKNVDHCLLY 

1861 AAATTTGCCATTTGGGAGGCCTAGGGTACTGCAGAAGAACGTGGACCACTGTCTCCTTTA 

627 HREYVSGFGKAMRMPMWSSY 

1921 CCACAGGGAATATGTCAGTGGATTTGGAAAAGCTATGAGGATGCCCATGTGGAGTTCATA 

647 TVPQLGDTSPLPPTVPDCLR 

1981 CACAGTCCCCCAGTTGGGAGACACATCGCCTCTGCCTCCCACTGTCCCAGACTGTCTGCG 

667 ADVRVPPSESQKCS FYLADK 

2 041 GGCTGATGTCAGGGTTCCTCCTTCTGAGAGCCAAAAATGTTCCTTCTATTTAGCAGACAA 
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687 NITHGFLYPPASNRTSDSQY 
2101 GAATATCACCCACGGCTTCCTCTATCCTCCTGCCAGCAATAGAACATCAGATAGCCAATA 

707 DALITSNLVPMYEEFRKMWD 
2161 TGATGCTTTAATTACTAGCAATTTGGTACCTATGTATGAAGAATTCAGAAAAATGTGGGA 

727 YFHSVLLIKHATERNGVNVV 
2221 CTACTTCCACAGTGTTCTTCTTATAAAACATGCCACAGAAAGAAATGGAGTAAATGTGGT 

747 SG 'PI FDYNYDGHFDAPDEIT 
22 81 TAGTGGACCAATATTTGATTATAATTATGATGGCCATTTTGATGCTCCAGATGAAATTAC 

767 KHLANTDVPI PT HYFVVLTS 
2341 CAAACATTTAGCCAACACTGATGTTCCCATCCCAACACACTACTTTGTGGTGCTGACCAG 

787 CKNKSHTPENCPGWLDVLPF 
2401 TTGTAAAAACAAGAGCCACACACCGGAAAACTGCCCTGGGTGGCTGGATGTCCTACCCTT 

807 I IPHRPTNVESCPEGKPEAL 
2461 TATCATCCCTCACCGACCTACCAACGTGGAGAGCTGTCCTGAAGGTAAACCAGAAGCTCT 

827 WVEERFTAHIARVRDVELLT 
2521 TTGGGTTGAAGAAAGATTTACAGCTCACATTGCCCGGGTCCGTGATGTAGAACTTCTCAC 

847 GLDFYQDKVQPVSEILQLKT 
2581 TGGGCTTGACTTCTATCAGGATAAAGTGCAGCCTGTCTCTGAAATTTTGCAACTAAAGAC 

867 YLPTFETTI * 
2 641 ATATTTACCAACATTTGAAACCACTATTTAActtaataatgtctacttaatatataattt 
2 701 actgtataaagtaattttggcaaaatataagtgattttttctggagaattgtaaaataaa 
2 761 gttttctatttttccttaaaaaaaaaaccggaattccgggcttgggaggctgaggcagga 
2 821 gactcgcttgaacccgggaggcagaggttgcagtgagccaagattgcgccattgcactcc 
2 881 agagcctgggtgacagagcaagactacatctcaaaaaataaataaataaaataaaagtaa 

2 94 1 caataaaaataaaaagaacagcagagagaatgagcaaggagaaatgtcacaaactat tgc 
3001 aaaatactgttacactgggttggctctccaagaagatactggaatctcttcagccatttg 

3 061 cttttcagaagtagaaaccagcaaaccacctctaagcggagaacatacgattctttatta 
3121 agtagctctggggaaggaaagaataaaagttgatagctccc tgattggga aa aaatgcac 
3181 aattaataaagaatgaagatgaaagaaagcatgcttatgttgtaacacaaaaaaaattca 
3241 caaacgttggtggaaggaaaacagtatagaaaacattactttaactaaaagctggaaaaa 
3301 ttttcagttgggatgcgactgacaaaaagaacgggatttccaggcataaagttggcgtga 
3361 gctacagagggcaccatgtggctcagtggaagacccttcaagattcaaagttccatttga 
3421 cagagcaaaggcacttcgcaaggagaagggtttaaattatgggtccaaaagccaagtggt 
3481 aaagcgagcaatttgcagcataactgcttctcctagacagggctgagtgggcaaaatacg 
3 541 acagtacacacagtgactattagccactgccagaaacaggctgaacagccctgggagaca 
3601 agggaaggcaggtggtgggagttgttcatggagagaaaggagagttttagaaccagcaca 
3661 tccactggagatgctgggccaccagacccctcccagtcaataaagtctggtgcctcattt 
3721 gatctcagcctcatcatgaccctggagagaccctgataccatctgccagtccccgacagc 
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3781 ttaggcactccttgccatcaacctgaccccccgagtggttctccaggctccctgccccac 
3841 ccattcaggccggaattc 

Figure 2B: The cDNA (SEQ ID. NO. : 746) and amino acid sequence (SEQ ID. NO. : 745) of 
161P2F10B variant 1. The 3858 nucleotide sequence of 161P2F10B variant 1 is shown. The start methionine is 
underlined. The open reading frame extends from nucleic acid 44-2671 including the stop codon. 

1 M E S T L T 

1 ctactttattctgataaaacaggtctatgcagctaccaggacaATGGAATCTACGTTGAC 
7LATEQPVKKNTLKKYKIACI 

6 1 TTTAGCAACGGAACAACCTGTTAAGAAGAACACTCTTAAGAAATATAAAATAGCTTGCAT 

27VLLALLVIMSLGLGLGLGLR 
121 TGTTCTTCTTGCTTTGCTGGTGATCATGTCACTTGGATTAGGCCTGGGGCTTGGACTCAG 

47KLEKQGSCRKKCFDASFRGL 
181 GAAACTGGAAAAGCAAGGCAGCTGCAGGAAGAAGTGCTTTGATGCATCATTTAGAGGACT 

67 ENCRCDVACKDRGDCCWDFE 
241 GG AGAACTGC CGGTGTG ATGTGGCATGTAAAG AC CGAGGTGATTGCTGCTGGGATTTTGA 

87DTCVESTRIWMCNKFRCGET 
301 AGACACCTGTGTGGAATCAACTCGAATATGGATGTGCAATAAATTTCGTTGTGGAGAGAC 
107 RLEASLCSCSDDCLQRKDCC 
361 CAGATTAGAGGCCAGCCTTTGCTCTTGTTCAGATGACTGTTTGCAGAGGAAAGATTGCTG 
127 ADYKSVCQGETSWLEENCDT 
421 TGCTGACTATAAGAGTGTTTGCCAAGGAGAAACCTCATGGCTGGAAGAAAACTGTGACAC 
147 AQQSQCPEGFDLPPVILFSM 
4 81 AGCCCAGCAGTCTCAGTGCCCAGAAGGGTTTGACCTGCCACCAGTTATCTTGTTTTCTAT 
167 DGFR AEYLYTWDTLMPNINK 

541 GGATGGATTTAGAGCTGAATATTTATACACA TGGGATACTTTAATGCCAAATATCAATAA 

187 L K f ~~ C G T HSKYMRAMYPTKTF 
601 ACTGAAAACATGTGGAATTCATTCAAAATACATGAGAGCTATGTATCCTACCAAAACCTT 
207 PNHYTIVTGLYPESHGI IDN 
661 CCCAAATCATTACACCATTGTCACGGGCTTGTATCCAGAGTCACATGGCATCATTGACAA 
227 NMYDVNLNKNFSLSSKEQNN 
721 TAATATGTATGATGTAAATCTCAACAAGAATTTTTCACTTTCTTCAAAGGAACAAAATAA 
247 PAWWHGQPMWLTAMYQGLKA 
7 81 TCCAGCCTGGTGGCATGGGCAACCAATGTGGCTGACAGCAATGTATCAAGGTTTAAAAGC 
267 ATYFWPGSEVAINGSFPSIY 
841 CGCTACCTACTTTTGGCCCGGATCAGAAGTGGCTATAAATGGCTCCTTTCCTTCCATATA 
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287 MPYNGSVPFEERISTLLKWL 

901 CATGCCTTACAACGGAAGTGTCCCATTTGAAGAGAGGATTTCTACACTGTTAAAATGGCT 

307 DLPKAERPRFYTMYFEEPDS 

961 GGACCTGCCCAAAGCTGAAAGACCCAGGTTTTATACCATGTATTTTGAAGAACCTGATTC 

327 SGHAGGPVSARVIKALQV VD 
1021 CTCTGGACATGCAGGTGGACCAGTCAGTGCCAGAGTAATTAAAGCCTTACAGGTAGTAGA 

347 HAFGMLMEGLKQRNLHNCVN 
1081 TCATGCTTTTGGGATGTTGATGGAAGGCCTGAAGCAGCGGAATTTGCACAACTGTGTCAA 

367 I ILLADHGMDQTYCNKMEYM 
1141 TATC ATC CTTCTGGCTGAC C ATGGAATGGACC AGACTTATTGTAACAAGATGG AATACAT 

387 TDYFPRINFFYMYEGPAPRI 
12 01 GACTGATTATTTTCCCAGAATAAACTTCTTCTACATGTACGAAGGGCCTGCCCCCCGCAT 

407 RAHNIPHDFFSFNSEEIVRN 
1261 CCGAGCTCATAATATACCTCATGACTTTTTTAGTTTTAATTCTGAGGAAATTGTTAGAAA 

427 LSCRKPDQHFKPYLTPDLPK 
1321 CCTCAGTTGCCGAAAACCTGATCAGCATTTCAAGCCCTATTTGACTCCTGATTTGCCAAA 

447 RLHYAKNVRIDKVHLFVDQQ 
1381 GCGACTGCACTATGCCAAGAACGTCAGAATCGACAAAGTTCATCTCTTTGTGGATCAACA 

467 WLAVRS KSNTNCGGGNHGYN 
1441 GTGGCTGGCTGTTAGGAGTAAATCAAATACAAATTGTGGAGGAGGCAACCATGGTTATAA 

487 NEFRSMEAIFLAHGPSFKEK 
1501 CAATGAGTTTAGGAGCATGGAGGCTATCTTTCTGGCACATGGACCCAGTTTTAAAGAGAA 

507 TEVEPFENIEVYNLMCDLLR 
1561 GACTGAAGTTGAACCATTTGAAAATATTGAAGTCTATAACCTAATGTGTGATCTTCTACG 

527 IQPAPNNGTHGSLNHLLKVP 
1621 CATTCAACCAGCACCAAACAATGGAACCCATGGTAGTTTAAACCATCTTCTGAAGGTGCC 

547 FYEPSHAEEVSKFSVCGFAN 
1681 TTTTTATGAGCCATCCCATGCAGAGGAGGTGTCAAAGTTTTCTGTT TGTGGCTTTGCTAA 
-567 — P — L — P T E S L D C F C~P H L Q N S T Q L 
1741 TCCATTGCCCACAGAGTCTCTTGACTGTTTCTGCCCTCACCTACAAAATAGTACTCAGCT 

587 EQVNQMLNLTQEEITATVKV 
1801 GGAACAAGTGAATCAGATG CTAAATCTCAC C CAAG AAGAAATAAC AGC AACAGTGAAAGT 

607 NLPFGRPRVLQKNVDHCLLY 
1861 AAATTTGCC ATTTGGGAGG CCTAGGGTACTGCAGAAGAACGTGGAC CACTGTCTCCTTTA 

627 HREYVSGFGKAMRMPMWSSY 
192 1 C CACAGGGAATATGTCAGTGGATTTGGAAAAGCTATGAGGATGCC CATGTGGAGTTCATA 

647 TV PQLGDTSPLPPTVPDCLR 
1981 CACAGTCCCCCAGTTGGGAGACACATCGCCTCTGCCTCCCACTGTCCCAGACTGTCTGCG 
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667 ADVRVPPSESQKCSFYLADK 
2 041 GGCTGATGTCAGGGTTCCTCCTTCTGAGAGCCAAAAATGTTCCTTCTATTTAGCAGACAA 

687 NITHGFLYPPASNRTSDSQY 
2101 GAATATCACCCACGGCTTCCTCTATCCTCCTGCCAGCAATAGAACATCAGATAGCCAATA 

707 DALITSNLVPMYEEFRKMWD 
2161 TGATGCTTTAATTACTAGCAATTTGGTACCTATGTATGAAGAATTCAGAAAAATGTGGGA 

727 YFHSVLLI KHATERNGVNVV 
2221 CTACTTCCACAGTGTTCTTCTTATAAAACATGCCACAGAAAGAAATGGAGTAAATGTGGT 

747 SGPIFDYNYDGHFDAPDEIT 
2281 TAGTGGACCAATATTTGATTATAATTATGATGGCCATTTTGATGCTCCAGATGAAATTAC 

767 KHLANTDVP I PTHYFVVLTS 
2341 CAAACATTTAGCCAACACTGATGTTCCCATCCCAACACACTACTTTGTGGTGCTGACCAG 

787 CKNKSHTPENCPGWLDVLPF 
2401 TTGTAAAAACAAGAGCCACACACCGGAAAACTGCCCTGGGTGGCTGGATGTCCTACCCTT 

807 I IPHRPTNVESCPEGKPEAL 
24 61 TATCATCCCTCACCGACCTACCAACGTGGAGAGCTGTCCTGAAGGTAAACCAGAAGCTCT 

827 WVEERFTAHIARVRDVELLT 
2521 TTGGGTTGAAGAAAGATTTACAGCTCACATTGCCCGGGTCCGTGATGTAGAACTTCTCAC 

847 GLDFYQDKVQPVSEILQLKT 
2581 TGGGCTTGACTTCTATCAGGATAAAGTGCAGCCTGTCTCTGAAATTTTGCAACTAAAGAC 

867 YLPTFETTI * 
2641 ATATTTACCAACATTTGAAACCACTATTTAActtaataatgtctacttaatatataattt 
2 701 actgtataaagtaattttggcaaaatataagtgattttttctggagaattgtaaaataaa 
2761 gttttctatttttccttaaaaaaaaaaccggaattccgggcttgggaggctgaggcagga 
2821 gactcgcttgaacccgggaggcagaggttgcagtgagccaagattgcgccattgcactcc 
2881 agagcctgggtgacagagcaagactacatctcaaaaaataaataaataaaataaaagtaa 
2941 caataaaaataaaaagaacagcagagagaatgagcaaggagaaatgtcacaaactattgc 
3001 aaaatactgttacactgggttggctctccaagaagatact ggaatctcttcagccatttg 
3061 cttttcagaagtagaaaccagcaaaccacctctaagcggagaacatacgattctttatta 
3121 agtagctctggggaaggaaagaataaaagttgatagctccctgattgggaaaaaatgcac 
3181 aattaataaagaatgaagatgaaagaaagcatgcttatgttgtaacacaaaaaaaattca 
3241 caaacgttggtggaaggaaaacagtatagaaaacattactttaactaaaagctggaaaaa 
3301 ttttcagttgggatgcgactgacaaaaagaacgggatttccaggcataaagttggcgtga 
3361 gctacagagggcaccatgtggctcagtggaagacccttcaagattcaaagttccatttga 
3421 cagagcaaaggcacttcgcaaggagaagggtttaaattatgggtccaaaagccaagtggt 
34 81 aaagcgagcaatttgcagcataactgcttctcctagacagggctgagtgggcaaaatacg 
3541 acagtacacacagtgactattagccactgccagaaacaggctgaacagccctgggagaca 
3601 agggaaggcaggtggtgggagttgttcatggagagaaaggagagttttagaaccagcaca 
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3661 tccactggagatgctgggccaccagacccctcccagtcaataaagtctggtgcctcattt 
3721 gatctcagcctcatcatgaccctggagagaccctgataccatctgccagtccccgacagc 
3781 ttaggcactccttgccatcaacctgaccccccgagtggttctccaggctccctgccccac 
3841 ccattcaggccggaattc 
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Figure 3A. Amino acid sequence of 161P2F10B (SEQ ID. NO. : 747). The 161P2F10B protein has 875 amino 
acids. 



1 


MESTLTLATE 


QPVKKNTLKK 


YKIACIVLLA 


LLVIMSLGLG 


LGLGLRKLEK 


QGSCRKKCFD 


61 


ASFRGLENCR 


CDVACKDRGD 


CCWDFEDTCV 


ESTRIWMCNK 


FRCGETRLEA 


SLCSCSDDCL 


121 


QKKDCCADYK 


SVCQGETSWL 


EENCDTAQQS 


QCPEGFDLPP 


VILFSMDGFR 


AEYLYTWDTL 


181 


MPNINKLKTC 


GIHSKYMRAM 


YPTKTFPNHY 


TIVTGLYPES 


HGIIDNNMYD 


VNLNKNFSLS 


241 


SKEQNNPAWW 


HGQPMWLTAM 


YQGLKAATYF 


WPGSEVAING 


SFPSIYMPYN 


GSVPFEERIS 


301 


TLLKWLDLPK 


AERPRFYTMY 


FEEPDSSGHA 


GGPVSARVIK 


ALQWDHAFG 


MLMEGLKQRN 


361 


LHNCVNIILL 


ADHGMDQTYC 


NKMEYMTDYF 


PRINFFYMYE 


GPAPRIRAHN 


IPHDFFSFNS 


421 


EEIVRNLSCR 


KPDQHFKPYL 


TPDLPKRLHY 


AKNVRIDKVH 


LFVDQQWLAV 


RSKSNTNCGG 


481 


GNHGYNNEFR 


SMEAIFLAHG 


PSFKEKTEVE 


PFENIEVYNL 


MCDLLRIQPA 


PNNGTHGSLN 


541 


HLLKVPFYEP 


SHAEEVSKFS 


VCGFANPLPT 


ESLDCFCPHL 


QNSTQLEQVN 


QMLNLTQEEI 


601 


TATVKVNLPF 


GRPRVLQKNV 


DHCLLYHREY 


VSGFGKAMRM 


PMWSSYTVPQ 


LGDTSPLPPT 


661 


VPDCLRADVR 


VPPSESQKCS 


FYLADKNITH 


GFLYPPASNR 


TSDSQYDALI 


TSNLVPMYEE 


721 


FRKMWDYFHS 


VLLIKHATER 


NGVNWSGPI 


FDYNYDGHFD 


APDEITKHLA 


NTDVPIPTHY 


781 


FWLTSCKNK 


SHTPENCPGW 


LDVLPFIIPH 


RPTNVESCPE 


GKPEALWVEE 


RFTAHIARVR 


841 


DVELLTGLDF 


YQDKVQPVSE 


ILQLKTYLPT 


FETTI 







Figure 3B. Amino acid sequence of 161P2F10B variant 1 (SEQ ID. NO. : 748). The 161P2F10B variant 1 
protein has 875 amino acids. 

1 MESTLTLATE QPVKKNTLKK YKIACIVLLA LLVIMSLGLG LGLGLRKLEK QGSCRKKCFD 

61 ASFRGLENCR CDVACKDRGD CCWDFEDTCV ESTRIWMCNK FRCGETRLEA SLCSCSDDCL 

121 QRKDCCADYK SVCQGETSWL EENCDTAQQS QCPEGFDLPP VILFSMDGFR AEYLYTWDTL 

181 MPNINKLKTC GIHSKYMRAM YPTKTFPNHY TIVTGLYPES HGIIDNNMYD VNLNKNFSLS 

241 SKEQNNPAWW HGQPMWLTAM YQGLKAATYF WPGSEVAING SFPSIYMPYN GSVPFEERIS 

3 01 TLLKWLDLPK AERPRFYTMY FEEPDSSGHA GGPVSARVIK ALQWDHAFG MLMEGLKQRN 

361 LHNCVNIILL ApHGjTOQTXQ— NKMEYMTDYF— PR IN FFYMYE - G P APR I-RAHN— IPHDFFS FNS ~ 

421 EEIVRNLSCR KPDQHFKPYL TPDLPKRLHY AKNVRIDKVH LFVDQQWLAV RSKSNTNCGG 

481 GNHGYNNEFR SMEAIFLAHG PSFKEKTEVE PFENIEVYNL MCDLLRIQPA PNNGTHGSLN 

541 HLLKVPFYEP SHAEEVSKFS VCGFANPLPT ESLDCFCPHL QNSTQLEQVN QMLNLTQEEI 

601 TATVKVNLPF GRPRVLQKNV DHCLLYHREY VSGFGKAMRM PMWSSYTVPQ LGDTSPLPPT 

661 VPDCLRADVR VPPSESQKCS FYLADKNITH GFLYPPASNR TSDSQYDALI TSNLVPMYEE 

721 FRKMWDYFHS VLLIKHATER NGVNWSGPI FDYNYDGHFD APDEITKHLA NTDVPIPTHY 

781 FWLTSCKNK SHTPENCPGW LDVLPFIIPH RPTNVESCPE GKPEALWVEE RFTAHIARVR 

841 DVELLTGLDF YQDKVQPVSE ILQLKTYLPT FETTI 
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4C) Alignment of 161P2F10B (SEQ. ID. No. 749) and SNP variant 2 (SEQ. ID. No. 750) carrying a T to P 
mutation at position 874. 

MEAIFLAHGPSFKEKTEVEPFENIEVYNLMCDLLRIQPAPNNGTHGSLNHLLKVPFYEPS 551 
MEAIFLAHGPSFKEKTEVEPFENIEVYNLMCDLLRIQPAPNNGTHGSLNHLLKVPFYEPS 



HAEEVSKFSVCGFANPLPTESLDCFCPHLQNSTQLEQVNQMLNLTQEEITATVKVNLPFG 



RPRVLQKNVDHCLLYHREYVSGFGKAMRMPMWSSYTVPQLGDTSPLPPTVPDCLRADVRV 



PPSESQKCSFYLADKNITHGFLYPPASNRTSDSQYDALITSNLVPMYEEFRKMWDYFHSV 



LLIKHATERNGVNWSGPIFDYNYDGHFDAPDEITKHLANTDVPIPTHYFWLTSCKNKS 



HTPENCPGWLDVLPFIIPHRPTNVESCPEGKPEALWVEERFTAHIARVRDVELLTGLDFY 
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Figure 5: 161P2F10B Hydrophilicity profile 

(Hopp TP., Woods K.R., 1981. Proc. Natl. Acad. Sci. U.S.A. 78:3824-3828) 




Figure 6: 161P2F10B Hydropathicity Profile 

(Kyte J., Doolittle R.F., 1982. J. Mol. Biol. 157:105-132) 




Figure 7: 161P2F10B % Accessible Residues Profile 

(JaninJ., 1979. Nature 277:491-492) 




Figure 8: 161P2F10B Average Flexibility Profile 

(Bhaskaran R., Ponnuswamy P.K., 1988. 
Int. J. Pept. Protein Res. 32:242-255) 




Figure 9: 161P2F10B Beta-turn Profile 

(Deleage, G., Roux B. 1987. Protein Engineering 1:289-294) 
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Figure 19A (Seq. ID. No. 751) 
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MESTLTLATEQPVKKNTLKKYKIACIVLLALLVIMSLGLGLGLGLRKLEKQGSCRKKCFDASFRGLENCR 
ccceeeeecccccccchhhhhhhhhhhhhhhhhhhhhchhhhhhhhhhhccccccccccccccchhcccc 
CDVACKDRGDCCWDFEDTCVESTRIWMCNKFRCGETRLEASLCSCSDDCLQKKDCCADYKSVCQGETSWL 
ccccccccccccccccccchhhhhehhhhhccccccceeechccccccccccccccccccccccccccch 
EENCDTAQQSQCPEGFDLPPVILFSMDGFRAEYLYTWDTLMPNINKLKTCGIHSKYMRAMYPTKTFPNHY 
cccccccccccccccccccceeeeeccccchhhhhhhhhhhhchhhhccccchhhhheecccccccccce 
TIVTGLYPESHGIIDNNMYDVNLNKNFSLSSKEQNNPAWWHGQP^LTAMYQGLKAATYFWPGSEVAING 
eeeeeccccccchhcccccccccccceeecccccccccccccchhhhhhhhhcccceeeecccceeeecc 
SFPSIYMPYNGSVPFEERISTLLKWLDLPKAERPRFYTMYFEEPDSSGHAGGPVSARVIKALQWDHAFG 

MLMEGLKQRNLHNCVNIILLADHGMDQTYCNKMEYMTDYFPRINFFYMYEGPAPRIRAHNIPHDFFSFNS 
hhhhhhhhhhcccceeeeeeeccccccccccchhhhhhhccceeeeeecccccccccccccccchhhcch 
EE I VRNLS CRKPDQHFKP YLTPDLPKRLHYAKNVRIDKVHLFVDQQWLAVRS KSNTNCGGGNHG YNNEFR 
hhhhhhhcccccccccccccccccchhhhhhcccccchhhhhhhhhhheeeccccccccccccccchhhh 
SMEAIFLAHGPSFKEKTEVEPFENIEVYNLMCDLLRIQPAPNNGTHGSLNHLLKVPFYEPSHAEEVSKFS 
hhhhhhhhccccccccccccccchhhhhhhhhhhhhcccccccccccccchheeccccccccccchccee 
VCGFANPLPTESLDCFCPHLQNSTQLEQWQMLNLTQEEITATVKVNLPFGRPRVLQKNVDHCLLYHREY 
eeccccccccccchhhcccccccchhhhhhhhhhhchhhhhheeeeccccccccehcccccceeeehhhh 
VSGFGKAMRMPMWSSYTVPQLGDTSPLPPTVPDCLRADVRVPPSESQKCSFYLADKNITHGFLYPPASNR 
hhccccceeeccccccccccccccccccccccccccccccccccccccceeeeccccccccccccccccc 
TSDSQYDALITSNLVPMYEEFRKMWDYFHSVLLIKHATERNGVNWSGPIFDYNYDGHFDAPDEITKHI^ 
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c: random coil (31.31%) 

e: extended strand (11.31%) 

h: alpha helix (57.37%) 



Figure 19B 

TMpred outpui for unknown 
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Figure 19C 

TMHMM posterior probabilities for Sequence 
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